A Markov Decision Process-based handicap system for tennis
نویسندگان
چکیده
منابع مشابه
A generalized Markov decision process
— In this paper we present a generalized Markov décision process that subsumes the traditional discounted, infinité horizon, finite state and action Markov décision process, VeinotCs discountéd décision processes, and Koehler's generalization of these two problem classes. Résumé. — Nous présentons dans cet article un processus de Markov généralisé qui englobe le processus de décision markovien ...
متن کاملQuantile Markov Decision Process
In this paper, we consider the problem of optimizing the quantiles of the cumulative rewards of Markov Decision Processes (MDP), to which we refers as Quantile Markov Decision Processes (QMDP). Traditionally, the goal of a Markov Decision Process (MDP) is to maximize expected cumulative reward over a defined horizon (possibly to be infinite). In many applications, however, a decision maker may ...
متن کاملOptimization for condition-based maintenance with semi-Markov decision process
The semi-Markov decision model is a powerful tool in analyzing sequential decision processes with random decision epochs. In this paper, we have built the semi-Markov decision process (SMDP) for the maintenance policy optimization of condition-based preventive maintenance problems, and have presented the approach for joint optimization of inspection rate and maintenance policy. Through numerica...
متن کاملControlling deliberation in a Markov decision process-based agent
Meta-level control manages the allocation of limited resources to deliberative actions. This paper discusses efforts in adding meta-level control capabilities to a Markov Decision Process (MDP)-based scheduling agent. The agent’s reasoning process involves continuous partial unrolling of the MDP state space and periodic reprioritization of the states to be expanded. The meta-level controller ma...
متن کاملEvaluation of a Markov Decision Process-based Coordinated Sampling Method
The paper evaluates the use of Markov Decision Processes (MDP) as a framework for coordinated sensing and adaptive communication between distributed sensors. The technique enables distributed sensors to adapt their sampling rates in response to changing event criticality and the availability of resources (energy) at each node. The relationship between energy consumption, sampling rates, and uti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Quantitative Analysis in Sports
سال: 2016
ISSN: 1559-0410,2194-6388
DOI: 10.1515/jqas-2016-0057